Experimental requirements for Grover's algorithm in optical quantum computation 
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The field of linear optical quantum computation (LOQC) will soon need a repertoire of experi- 
mental milestones. We make progress in this direction by describing several experiments based on 
Grover's algorithm. These experiments range from a relatively simple implementation using only a 
single non-scalable CNOT gate to the most complex, requiring two concatenated scalable CNOT gates, 
and thus form a useful set of early milestones for LOQC. We also give a complete description of 
basic LOQC using polarization-encoded qubits, making use of many simplifications to the original 
scheme of Knill, Laflamme, and Milburn 



I. INTRODUCTION 

In the next few years, we can expect to see demonstra- 
tions of basic quantum gates in several implementations 
of quantum computation. With this in sight, it is natural 
to look ahead to what interesting quantum circuits can be 
built out of a small number of one- and two-qubit gates 
acting on a few qubits, as these circuits will provide mile- 
stones on the way to full-scale quantum computation . 

Grover's search algorithm 0,0 is a good candidate for 
such a milestone. It is a quantum algorithm identifying 
one of N elements, marked by an oracle, with order \ N 
uses of the oracle. When the search space consists of 
4 elements, the algorithm is guaranteed to produce the 
marked element after one use of the oracle, compared to 
the 2.25 uses expected in a classical search. We will see 
that it can be implemented using only 7 one-qubit gates 
and 2 two-qubit gates, which makes it an excellent tar- 
get once one- and two-qubit gates have been mastered. 
Not surprisingly, it was one of the first algorithms to 
be experimentally implemented in nuclear magnetic res- 
onance quantum computing (Chuang, Gershenfeld, and 
Kubinec p| and Jones, Mosca, and Hansen Jj|). 

A promising quantum computing technology is the 
linear optical quantum computation (LOQC) scheme of 
Knill, Laflamme, and Milburn p| (see Gottesman, Ki- 
taev, and Preskill |7| for an alternative approach). In 
this scheme, one-qubit gates are relatively straightfor- 
ward. While implementing a scalable universal two-qubit 
gate such as a CNOT remains a challenge, such a gate is 
likely to be demonstrated in the next couple of years. 
Already, a non-scalable CNOT has been approximately 
implemented by Pittman et al. [8j- For these reasons, it 
is important to establish some specific LOQC milestones 
on the path toward building a large quantum computer, 
in the form of some simple algorithms on a few qubits. 

This pursuit is dogged by conceptual difficulties asso- 
ciated with quantum algorithms on a very small number 



of qubits, summed up in the question: What is the crite- 
rion for "quantumness" ? A reasonable criterion, partic- 
ularly in the context of Grover's algorithm, is to require 
a "speed-up" over the best classical algorithm. However, 
this notion can be hard to make sense of when the number 
of steps is on the order of ten, rather than millions, and 
the problem can easily be done by hand (not to mention 
by a GHz classical processor). Furthermore, sometimes 
the reduction in the number of steps can be achieved in 
an implementation whose physical requirements grow ex- 
ponentially with the number of qubits, trading off time 
for space. The question of whether or not this counts as 
"quantum" has received much attention (see, for exam- 
ple, Kwiat et al. , Bhattacharya, van den Heuvell, and 
Spreeuw [lfij). 

Perhaps the best solution to this problem is a prag- 
matic one. In the quest to build a quantum computer 
large enough to provide a genuine advantage over classi- 
cal computers, two things must be achieved. First, a fine 
level of quantum control must be demonstrated for both 
single qubits and pairs of qubits. Second, it will be nec- 
essary to show that the number of components (qubits 
and gates) in a circuit can be increased without insur- 
mountable increases in difficulty. In particular, we must 
avoid exponential increases in the amount of resource us- 
age (either time or space) — the implementation must 
be scalable 1 . 

Therefore, the importance of an experimental achieve- 
ment of an early milestone (such as the four-element 
Grover's algorithm) should be measured primarily on 
these criteria. A demonstration that Grover's algorithm 
finds the marked item in fewer steps than is possible with 
a classical computer is an important goal, but it is less 
important than the fine level of quantum control that 
it implies. At this early stage of development of quan- 
tum computers, any such demonstration is a significant 
achievement, while a demonstration of such control in a 
scalable manner is likely to be significantly more difficult 
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1 Blume-Kohout, Caves, and Deutsch llll give a general charac- 
terization of the requirements for scalability. 
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and consequently more impressive. 

This is illustrated by the experiment of Kwiat et al. 9] , 
which demonstrated the ability to implement the search 
algorithm in a quantum optical system, but using an 
encoding that is not scalable — as they point out, the 
number of optical elements that they require grows expo- 
nentially in the number of qubits in their system. Thus, 
although their techniques might be successfully extended 
to a few qubits, they are not practical as the basis for an 
approach to building a quantum computer. 

In contrast, we are explicitly concerned with devel- 
oping experimental milestones on the path toward full- 
scale quantum computation in optical systems. We show 
that Grover's algorithm on four elements provides sev- 
eral experiments that gradually increase in complexity. 
The simplest version requires little more than a single, 
coincidence-basis CNOT gate together with a source of 
entangled photon pairs, while the most complex version 
requires two scalable CNOT gates and six photons. 

Before describing these experiments and their require- 
ments, we give a brief description of Grover's algorithm 
(Section [DJ and LOQC (Section Since the orig- 

inal proposal of LOQC, there have been many simpli- 
fications and improvements to the scheme. We give a 
concise description of the basics of LOQC making full 
use of these simplifications, focusing on a variant of the 
original scheme that uses polarization-encoded qubits. In 
Sections llVl and IVl we describe and compare several opti- 
cal circuits, all implementing Grover's algorithm on four 
elements. In Section IVII we briefly discuss appropriate 
figures of merit for Grover's algorithm, and we conclude 
in Section IVTTl 



II. GROVER'S ALGORITHM ON FOUR 
ELEMENTS 

Grover's algorithm 0, (see also Nielsen and 
Chuang |12| | for an elementary treatment on which much 
of this section is based) is a quantum algorithm that can 
speed up the solution to certain types of oracle-based 
computations. We will say more about oracles and their 
implementation after describing Grover's algorithm. 



A. Grover's algorithm 

Suppose our search space consists of TV = 2™ elements, 
of which one is a solution to a given problem. Grover's 
algorithm identifies the solution (with high probability) 
using n + 1 qubits according to the following algorithm: 

1. Prepare the state | o> 1 1> - 

2. Apply R® n+1 , where R = -± [\ \] is the one- 

qubit Hadamard gate. (We use the symbol R in- 
stead of the usual H to avoid confusion with the 
horizontal polarization state.) 
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FIG. 1: A circuit diagram for the four-element Grover algo- 
rithm, based on the figure in Box 6.1 of fl^l . The top two 
qubits are the data qubits, initialized in state |0)|0), while 
the bottom qubit is the ancilla qubit, initialized in state |1). 
The boxes labeled R and Z represent the one-qubit Hadamard 
and Pauli az gates, respectively. The CNOT is denoted by the 
usual symbol, while the gray half-circles represent one-qubit 
measurements in the computational basis, whose output ap- 
pears on the classical output wires (double lines) . The final X 
gate represents the classical NOT required to put the output 
into the correct form. The measurement always gives "1" on 
the ancilla qubit, while the data qubits give "a" and "6". It 
is straightforward to show that, in principle, ab is the state 
marked by the oracle. 



3. Apply the oracle, which flips the ancilla qubit con- 
ditional on the other qubits being in the state cor- 
responding to the solution. 

4. Apply R® n+1 . 

5. Apply a phase shift to the data qubits conditional 
on not being in the state |0)®™, described by the 
unitary operator 2|0) (0|® n — I n where /„ is the iden- 
tity operation on the data qubits. 

6. Apply R® n . 

7. Repeat steps (3) to (6) a specified number of times, 
then measure the qubits in the computational basis. 

The number of repetitions (which is also the number of 
uses of the oracle) that maximizes the probability of ob- 
taining the correct answer is the nearest integer to 



2arccos v / (7V- l)/N 



(1) 



(Boyer et al. see also [l2|). This number is bounded 
above by \ir\fN /4] , hence the claim that Grover's algo- 
rithm uses 0{sfN) oracle calls, compared to the 0{N) 
oracle calls required in the classical case. 

For the remainder of this paper, we restrict our at- 
tention to the case where the number of elements in the 
search space is N = 4. In that case, the number of rep- 
etitions specified by Eq. Q is exactly one. A simplified 
circuit based on the algorithm described above is shown 
in Fig.n It can be verified directly that this circuit, using 
only one oracle call, gives the correct answer with proba- 
bility 1, compared to the average of 2.25 oracle calls that 
must be made with a classical circuit. For example, if 
the solution is 10, then the output of the circuit is a = 1 
and b = 0. 
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FIG. 2: The circuit on the left shows the beginning of the 
Grover circuit with an example oracle (inside the dashed box) 
marking the item 10. We have implemented the oracle using 
a variant of the Toffoli gate, where the state of the third 
qubit is flipped when the first two qubits are in the state |10), 
as indicated by the closed and open circles on the control 
qubits. We have moved the measurement on the third qubit 
forward since it plays no further role in the algorithm. In 
the text, we show that this circuit is in fact equivalent to the 
simplification on the right, where the Toffoli has been replaced 
by a controlled- Z (CSIGN) operation followed by an X on the 
appropriate qubit. 



B. Implementing the oracle 

An oracle is a quantum circuit that recognizes solutions 
to a given problem. For example, suppose we wish to 
solve a version of the traveling salesman problem, where 
the goal is to find a route visiting a given collection of 
cities that is shorter than some specified length L. Al- 
though it is in general hard to find such a route, it is 
easy to recognize whether a proposed route solves the 
problem: simply add up the total distance the salesman 
would travel on the proposed route, and compare it to L. 

Specifically, an oracle is a circuit that, given an input 
consisting of a potential solution to a problem, flips the 
sign of an ancilla qubit if and only if the input is a solu- 
tion to the problem. Since the only action of the oracle 
is to recognize solutions, its internal structure is unim- 
portant in a test of the algorithm itself. Thus, for our 
purposes, the choice of oracle is arbitrary, and may be 
chosen to be as simple as possible. 

Although the internal workings of the oracle are unim- 
portant for the purposes of testing the algorithm, the 
complexity of implementing some oracle must be in- 
cluded to characterize the difficulty of performing the ex- 
periment. A simple implementation of an oracle marking 
one of four states is a Toffoli gate, with the control qubits 
negated where necessary to specify any of the states 00, 
01, 10, or 11 (see the left-hand side of Fig. [21 for the ex- 
ample where the marked state is 10). 

If the marked state is 10, the action of the oracle on 
the three qubits is to take the state (|00) + |01) + 1 10) + 
|ll»(|0>-|l»to 



(|00) + |01) + |11))(|0)-|1)) + |10)(|1)-|0)) 
= (|00) + |01)-|10) + |11))(|0)-|1)) 



(2) 



(omitting the normalization) . Thus the oracle simply has 
the effect of flipping the sign of the marked state. The 
ancilla is not used again, so it can be discarded at this 
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FIG. 3: Inserting the simplified oracle of Fig. [5] into the 
circuit of Fig.Qgives this circuit. Note that the marked state 
is specified inside the oracle (the dashed box) by the values of 
x\ and X2 used to determine whether or not the X gates are 
applied. (Note that addition in the exponent of the X gates 
is modulo 2.) Under ideal circumstances, the output of the 
circuit is a — xi and b = ap- 



point. 

Toffoli gates are difficult to implement in LOQC be- 
cause there is no known way to implement one without 
using several CNOTs. However, for our purposes, a full 
Toffoli gate is not required because the ancilla qubit plays 
such a limited role. The two-qubit circuit on the right- 
hand side of Fig. |21 illustrates this for the case where the 
marked state is 10. A single controlled- Z (csign) gate 
that flips the sign of the |11) state, followed by X gates 
to move the minus sign to the appropriate state, has the 
same action as the original oracle. 

A simplified circuit to implement the four-element 
Grover algorithm is given in Fig-OJ This is the circuit 
that we will work with for the remainder of this paper. 



III. LOQC WITH POLARIZATION ENCODING 

In LOQC, qubits are encoded in dual rail logic pj: 
Two modes A and B are used, and logical |0) and |1) 
are encoded as |1)a|0)b and |0)^|1)b, respectively. The 
modes may represent two different spatial modes, or two 
different polarization modes of a single spatial mode. 2 

In practice, it is likely that polarization-encoded qubits 
will be used, so that logical |0) and |1) are encoded as \H) 
and \ V), respectively, where H and V refer to horizontal 
and vertical polarization one-photon states of the same 
spatial mode. The main reasons for this are (1) it signif- 
icantly simplifies the implementation of the CNOT gate 
(see below), (2) it allows one-qubit gates to be imple- 
mented using only waveplates and phase delays rather 
than beamsplitters and interferometers, and (3) it re- 
duces the effects of noise by ensuring that, unlike with 
spatial encoding, both states follow the same path on 
the quantum wires between gates. In this section we de- 
scribe in some detail the construction of one-qubit gates 
and CNOTs in polarization-encoded LOQC. 



2 A good introduction to LOQC in spatial encoding is pro- 
vided by a set of lectures of Knill, available online at 
http: //online . itp .ucsb . edu/online/qinf o01/ 
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A. One-qubit gates 

To our knowledge, no complete description of how to 
implement basic quantum gates with polarization encod- 
ing has been given in the literature, so we provide one 
here. For one-qubit gates, waveplates and phase delays 
are sufficient. A waveplate with slow axis \H') and fast 
axis W) has action 



those that are not. In this section, we describe both 
types. (Note that the CNOT and CSIGN gates are re- 
lated by conjugation by Hadamards on the target bit, i.e. 
CNOT = (1(g) R) CSIGN (I <g> R). Therefore, in the context 
of LOQC where one-qubit gates are relatively straight- 
forward, these two gates are practically equivalent, and 
we will use the two almost interchangeably.) 



\H') 
W) 



e**\H' 

W), 



(3) 



where <f> is the resulting relative phase difference. Spe- 
cial cases in common use are the half and quarter wave- 
plates, with <fi equal to half and a quarter of a wavelength, 
respectively. Now suppose \H') is rotated counter- 
clockwise (with respect to the direction of travel of the 
light) by an angle a from \H). If the input state is 
[J] = h\H) + v\V), then the output is given by 



e 1 ^ cos 2 a + sin 2 a {e 1 ^ — 1) cos a sin a 
(e 1 ^ — 1) cos a sin a e 1 ^ sin 2 a + cos 2 a 





h 




V 



(4) 



Special cases of this transformation for common one- 
qubit gates are set out in Tabled The Hadamard and 7r/8 
gates, labeled R and T in the table, are a universal set 
for one-qubit quantum computation (Boy kin et al. [l4j). 
and so any one-qubit gate can be obtained by a sequence 
of waveplates, although it is convenient to allow phase 
delays as well. In the Grover circuit, the only one-qubit 
gates used are the R, X, and Z gates, and thus we only 
require half waveplates. 



B. Two-qubit gates 

Since the publication of the original LOQC scheme 
of KLM 0, many simplifications of their CSIGN gate 
have been developed, with varying trade-offs between 
simplicity and functionality. The different types may 
be divided into two classes, those that are scalable and 



Gate 


Optical element 


e"J = e"[j;] 

x = [lh) 

*=[5-°i] 

r =[?-„*! 


phase delay of — 

waveplate with cf> — 180°, a = —67.5° 
waveplate with <j> — 45° , a = 90° 
waveplate with <f> — 180° , a — —45° 
waveplate with <j> — 180° , a = 90° 
2 waveplates and a phase delay: (e"' 4 /)IZ 



TABLE I: Various one-qubit gates and their implementation 
in polarization-encoded LOQC. a and <f> refer to the angle of 
the slow axis to the horizontal and the relative phase added 
to light parallel to the slow axis, respectively. Note that T 
requires a waveplate with a relative delay of one eighth of a 
wavelength. 



1. Scalable two-qubit gates 

The KLM scheme pj has two properties that at 
first appear contradictory: the LOQC CSIGN is non- 
deterministic, but it is used to do computations in a scal- 
able manner. The non-deterministic nature of the KLM 
CSIGN is essential to engineer a two-photon interaction 
without using highly nonlinear materials, but it poses a 
problem: if its success probability is e < 1, then the suc- 
cess probability of a circuit with n CSiGNs is e" — it de- 
creases exponentially with n. A solution to this problem 
is the technique of gate teleportation described by Nielsen 
and Chuang ^(| and Gottesman and Chuang [l7j]- This 
technique allows the gates to be prepared as an offline 
resource, and then "teleported in" whenever required for 
a computation. KLM showed that the teleportation step 
can be made near-deterministic using a sufficiently large 
number of repetitions. This technique is unlikely to be 
used in early experiments, however, because the extra dif- 
ficulty involved in teleporting gates will more than cancel 
out the advantages of increasing the success probability 
when the number of CSiGNs is small. 

An essential feature required to make this work is that 
it must be possible to determine when the gate has suc- 
ceeded. The KLM CNOT has this property — although 
it only succeeds 1 time in 16, whether or not it has suc- 
ceeded is determined by the outcomes of measurements 
on ancilla photons. We use the term scalable to describe 
a CSIGN (or CNOt) that has the property that it is known 
when it succeeds. 

In this paper, we will not work directly with the KLM 
CSIGN since there are simpler alternatives, such as the 
closely related simplification proposed by Ralph et al. il&h 
and the substantial modification proposed by Knill |18j . 
There is also a promising alternative approach using en- 
tangled ancillas discovered by Pittman, Jacobs and Fran- 
son [n| that we will not consider further here. We focus 
on the CSIGN of Ralph et al, shown in Fig. 



3 Recent numerical work by Lu nd, Bell, and Ralph L'(l . shows that 
the simplified KLM CSIGN of ll is more resilient to detector and 
ancilla inefficiencies than the other two, perhaps because it acts 
symmetrically on the two qubits. For example, the fidelity of 
this gate (calculated as the fidelity of the actual output with 
the ideal output, minimized over input states) is larger than the 
fidelities of the other two gates for detector efficiencies up to ap- 
proximately 95%. However, it remains to be seen what effects 
other sources of error, such as mode-matching errors, and imper- 
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control 



target 




FIG. 4: The simplified KLM CSIGN of |15|. The top rail contains the control qubit and the bottom rail contains the target qubit, 
both encoded in the polarization of a single photon. A square with a diagonal line across it represents a polarizing beamsplitter. 
By convention, we always assume that the horizontal polarization is 100% reflected while the vertical polarization is 100% 
transmitted. So, for example, after the first polarizing beamsplitters, the topmost rail contains the horizontally polarized 
component of the control qubit. A thin rectangle represents an ordinary beamsplitter, with a sign change for the mode 
reflected from the thick black side and reflectivity given by the cosine of the angle written next to it. (If the input modes to a 
beamsplitter are |o)i n and |6)m, with the b mode receiving the sign change and with reflectivity given by cos a;, then the outputs 
are cosa;|a)out +sina;|fe)out and sina;|a)out — cos a;|6) ou t.) The circuit uses two vertically polarized ancilla photons. It succeeds 
if the first two measurements both count photons and the second two measurements both count 1 photon. 
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FIG. 5: A further simplified (but still scalable) polarization-encoded KLM CSIGN. A gray rectangle containing u x°" represents a 
half waveplate with slow axis at an angle of x° to the horizontal polarization. See Table^Jfor the corresponding one-qubit gates. 
This circuit works similarly to the previous one (Fig. 2J, but it takes fuller advantage of the orthogonality of the polarization 
states. For a full description, see the text. 



In fact, there is a further, substantial simplification to 
this circuit that is achieved by making fuller use of the 
polarization encoding, resulting in the circuit in Fig. [3] 
This gate still requires two ancilla photons. However, it 
uses fewer detectors, beamsplitters, and polarizing beam- 
splitters, and eliminates two interferometers. Its effect on 
qubit states is unchanged, up to an unimportant overall 
phase of —1. If we denote the beamsplitter reflectivities 
as ?7i = 5 — 3\/2 and 772 = (3 — \/2)/7 (which are approx- 
imated as cos 40.8° and cos 76.9° in the diagram), then 



the action of the gate is the following: 



|00) 
|01) 
|10) 

111) 



V^(2r7 2 -l)|00> = -Vp|00) 
r?i(3r ? |-27 ?2 )|01) = -^|01) 
Vm^(2r/2-l)|00) = -Vp|10) 
»fc|ll> = Vflll) 



(5) 



(11- 



where the success probability p is given by p = rfo 
6a/2)/49 ps 0.05. Thus the gate works approximately 1 
out of every 20 attempts. For the remainder of this paper, 
we will refer to this gate simply as a "scalable CSIGN" . 



feet beamsplitter reflectivities, will have on the relative merits of 
each gate. 



2. Coincidence-basis two-qubit gates 

An even simpler, but non-scalable CNOT was discov- 
ered by Hofmann and Takeuchi [2lJ and Ralph et al. ■ 
It succeeds 1 time in 9, but it only works in the coinci- 
dence basis, i.e. when the results of the whole compu- 



6 



control ■ 



54.7° 



54.7° 









<\54.7° 








-67.5° 














-67.5° 




/ 




s 





FIG. 6: The coincidence-basis cnot of HI], All three 
beamsplitters have the same reflectivity 1/3 « cos 54.7°. It 
can be turned into a CSIGN by removing the two halfwavc 
plates. Note that it is not necessary to have detectors on the 
reflected modes of the topmost and bottommost beamsplitters 
(even though measuring a photon in either of these modes 
would signal a failure), since other failures of this gate are 
undetectable until the end of the computation. The gate has 
worked if exactly one photon is found in each rail. 



tation are selected to contain an allowed distribution of 
photons among detectors. We call this a coincidence- 
basis CNOT. See Fig. |SJ This circuit has been designed 
so that if exactly one photon is measured in the top rail 
(in either polarization) , and one in the bottom rail, it has 
worked with certainty. Otherwise, the result is discarded 
and the experiment is repeated. It cannot in general be 
followed by further two-qubit gates, as it is possible for 
a later gate to mask a failure. Thus it cannot be used to 
do scalable quantum computation. 

The useful purpose served by this gate (as well as the 
coincidence-basis gate of @) is as a simpler intermediate 
step before the full complexity of a scalable cnot. In a 
general circuit, it may be possible to replace one or more 
scalable CNOTs with a coincidence-basis CNOT, thereby 
significantly reducing the complexity of circuits contain- 
ing a few CNOTs. In the following sections on construct- 
ing optical circuits to perform the four-element Grover 
algorithm, we will see some of these ideas in action. 



IV. THE TWO-QUBIT GROVER IN LOQC 

A simplified circuit for the four-element Grover algo- 
rithm was given in Fig. [3] In Fig.[7| this circuit is trans- 
lated directly into an optical circuit, using the prescrip- 
tions and circuits of the previous section. 

The circuit, which succeeds one time in approximately 
180 = 20 x 9 (the product of the number of attempts 
per success for each cnot), uses 10-12 half waveplates, 4 



5 beamsplitters (2 of which must be modematched), 
9 polarizing beamsplitters (4 of which must be mode- 
matched), 4 photons that must be simultaneously pro- 
duced in desired polarization states, and 7 single-photon 
detectors. The second CNOT can be done in the coinci- 
dence basis since there are no interactions between the 
two qubits following it. Therefore, if the final measure- 
ment contains an allowed distribution of photons (exactly 
one in the top two detectors and one in the bottom two 
detectors), we know that the second CNOT worked, which 
is sufficient for our purposes here. 5 

However, it is important to note that the output of this 
circuit (before the measurement) could not be used to do 
further calculations because of the uncertainty in the out- 
come of the second CNOT. If, for example, there were two 
photons in the top rail after the second CNOT, the sys- 
tem's state would no longer be in the "qubit space". A 
third CNOT might bring the system back into the qubit 
space, but it is unlikely to have performed the transfor- 
mation we expected. In this case, the overall circuit fails, 
but we have no way of detecting the failure (except to 
compare with the answer that we can calculate by hand 
for this simple case). 

To ensure reliability for further calculations, the sec- 
ond CNOT should be replaced by a scalable CNOT. The 
optical circuit for this case would work one time in 400, 
and would contain on the order of 14-16 waveplates, 
8 polarizing beamsplitters (6 of which would be mode- 
matched), 4 ordinary beamsplitters (2 of which would 
be modematched), 6 photons produced in desired polar- 
ization states simultaneously, and 10 single-photon detec- 
tors. This would be considerably more difficult to achieve 
experimentally. Since we are (in principle) guaranteed to 
be in the qubit space at the end of this circuit, the out- 
put of each pair of detectors should contain exactly one 
photon. Therefore, it is possible to simplify the final 
detection process by simply blocking out one of the po- 
larizations (horizontal, say), and then looking to see if a 
photon is detected. This would reduce the number of po- 
larizing beamsplitters to 6 and the number of detectors 
to 8, at the cost of introducing two polarization filters. 
However, in practice the number of photons at the out- 
put will sometimes be incorrect. Thus, the increase in 
simplicity would have to be weighed against the failures 
that would go undetected. 



4 Note that the 90° and 67.5° halfwave plates cannot be combined 
into a single waveplate: their product [J ~* ] has terms of 
opposite sign in the off-diagonal terms, while the waveplate equa- 



tion (Eq. J1J) has these entries equal. 
5 A small but potentially useful sim plifi cation is to remove the 
40.8° beamsplitter, as described in |20H , They show that, until 
detector and source efficiencies of up to approximately 99.5% are 
reached, the fidelity of the gate can be substantially increased 
by removing this beamsplitter and adjusting the reflectivity of 
the 76.9° beamsplitter. Given that beamsplitter reflectivities 
are imperfect, removing this beamsplitter is likely to decrease 
that source of error, while also decreasing the complexity of the 
circuit by removing a detector. There is a catch, however: the 
probability of success decreases by a factor of 4—5 for efficiencies 
of 80%-95%. 
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FIG. 7: An optical implementation of Grover's algorithm on four elements based on the circuit in Fig. [3] The oracle part is 
contained in the dashed box. This circuit is essentially the concatenation of the circuits for the scalable CSIGN (Fig. and 
the coincidence-basis CNOT (Fig. 0, together with a few extra waveplates. The output of the circuit is discarded unless the 
first measurement counts photons, the second two measurements both count 1 photon, and one photon is found in each 
pair of detectors at the end, i.e. an + av = &ff + bv = 1. Note that we have omitted the final correcting NOT gate on the 
classical output in this diagram, but it should still be done. For example, if the oracle marks state 10, then the algorithm has 
successfully identified the marked state if measurements return an = 1, av = 0, bn = 1, bv = 0. 



V. SIMPLIFICATIONS 

By far the most difficult aspect of the experiments just 
described is implementing the scalable CSiGNs. However, 
the CSIGN in the oracle is only used in a very restricted 
way, and it turns out that we can replace it with a much 
simpler circuit. Since only one input state is ever used, 
namely (\H) + \V))(\H) + |V)), only one state is ever out- 
put from the CSIGN , namely | HH) + \ VH) + \HV) - \ VV) . 
(We will continue to neglect normalization constants.) If 
a source of entangled input states were available, then the 
CSIGN could be replaced. In optics, such a source is in fact 
readily available: a parametric down conversion source 
can be used to produce the state \HH) + \ VV), which 
can be converted into our desired state by a Hadamard 
gate on the first qubit, \H) -> \H) + \V), \V) -> \H)-\V). 
Using this fact, a much simplified version of Grover's al- 
gorithm is presented in Fig.|S] 

The simplicity of this circuit compared with the previ- 
ous one is emphasized by comparing the number of com- 
ponents. This circuit works one in every 9 attempts, 
and requires 6-8 waveplates, 6 polarizing beamsplitters 
(of which 2 must be modematched) , 3 ordinary beam- 
splitters (1 of which must be modematched), 2 photons 
which are produced as the output of a parametric down- 
conversion source, and 4 single-photon detectors. 

What have we traded for this enormous gain in sim- 
plicity? It turns out that we have compromised the ver- 
satility of the algorithm. Most significantly, the oracle 
is no longer easily replaceable. In principle, the oracle 
should be a "' plug- in" component able to have many dif- 
ferent forms corresponding to different potential prob- 
lems. In this simplified scheme, however, we have ob- 
scured the line between the oracle and non-oracle parts 
of the circuit, making it difficult to see how to make the 
circuit solve a problem using a different oracle. In Fig. 03 
a dashed box outlines the "oracle" part of the circuit for 
comparison with the previous diagrams, but there is in 
fact no clear line dividing the oracle from the earlier part 
of the circuit. 



This change affects how the circuit could be used. One 
example is demonstrating the variation in the success 
probability of Grover's algorithm as a function of the 
number repetitions of steps (3)-(6) described in SeclTTI 
In the circuit in Fig.0 the oracle can be reused with some 
small changes. 6 On the other hand, in Fig. [SI this is not 
possible — the "oracle" can only be used once 7 . 



VI. FIGURES OF MERIT 

An important question that has so far not been ad- 
dressed is what the appropriate figures of merit are for 
this experiment. There are two related but distinct no- 
tions of success here. The first is to what extent the 
actual goal of Grover's algorithm has been achieved, i.e. 
how successfully the experiment distinguishes between 
the four different oracles. The second is how similar the 
actual operation of circuit is to the ideal operation. This 
second notion is important for using these experiments 
as tests of the ability to combine the basic elements of 



6 The oracle on the right-hand side of Fig. [5] is designed to work 
with inputs that are equal superpositions of computational basis 
states. If the oracle is used twice in the same circuit, then it is 
unlikely that the input state will always be the same. In order to 
make the oracle work for an arbitrary input state, it is necessary 
to simply duplicate the X gates following the CSIGN, before the 
CSIGN. For the example in Fig. [5] where the oracle marks the 
state |01), the oracle should consist of the following: an X gate 
acting on the bottom qubit, followed by the CSIGN, followed by 
the X acting on the bottom qubit. 

7 For a more speculative example, Grover's algorithm can be used 
to obtain upper bounds on an entanglement monotone called 
the Groveria n e ntanglement, as described by Biham, Osborne 
and Nielsen l2.il . The basic idea is that if an n-qubit state p 
(possibly mixed) is used as input rather than |0)® n , the square 
root of 1 minus the success probability gives a good measure of 
the entanglement of p. This application requires input states 
with varying degrees of entanglement, and thus is not possible 
in the simplified circuit. 
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FIG. 8: Grover's algorithm using a parametric down conversion input. This circuit works similarly to the previous one, but 
the oracle is no longer demarcated from the initial part of the circuit. The dashed box in this figure now contains both the 
oracle and the initialization to the state \HH) + \ VV). The advantage of this circuit is that it makes use of a natural source of 
optical entanglement (parametric down conversion) to replace the very difficult scalable CSIGN. The outputs from this circuit 
are accepted under the same conditions as the previous circuit (clb + ay — bn + by = 1), and the final classical NOT has again 
been omitted. 



quantum computation. It is clearly related to the first 
— if the experiment cannot reliably distinguish between 
the oracles, then the actual behavior of the circuit must 
be very far from the ideal operation. 

In order to be able to compare experiments (and also 
to optimize the performance of a particular experimen- 
tal setup), we need to be more precise about how to 
measure the success of these experiments. We suggest 
calculating figures of merit reflecting each of the two no- 
tions of success described above. The first is simply to 
measure the distinguishability of the distribution of mea- 
surement results output by the circuit for different ora- 
cles. For example, suppose that for the oracle marking 
the state 00, the results 00, 01, 10, and 11 occur with 
probabilities p 00 = {0.9,0.04,0.02,0.04}, while the cor- 
responding results when the oracle marks state 10 are 
pio = {0.01,0.08,0.8,0.11}. A simple indicator of the 
distinguishability of these two distributions is their fi- 
delity 

F(poo,pio) = ^ \/poo{x)pio(x) (6) 

X 

where x ranges over the measurement outcomes 
00, . . . , 11 and p a b(x) is the probabilities of obtaining re- 
sult x given that the oracle marked state ab. This quan- 
tity has the property that it is 1 precisely when the two 
distributions are identical and precisely when the two 
distributions are non-overlapping, that is, when the set 
of results for which the first distribution is non-zero has 
no elements in common with the set of results for which 
the second distribution is non-zero. 

In the context of Grover's algorithm, it is desirable to 
make the fidelity between the distributions arising from 
each pair of oracles as large as possible. (For an intro- 
duction to the fidelity, see, for example, 0,13- The re- 
lationship of the fidelity to distinguishability is explored 
in Wootters ^ and |2^.1 

The second figure of merit is related to the similar- 
ity of the actual operation implemented £ to the desired 



unitary U. U is obtained by simply multiplying together 
the circuit elements in Fig. EI £ , on the other hand, must 
be determined experimentally. Ideally, £ should be de- 
termined precisely using a method such as quantum pro- 
cess tomography (Chuang and Nielsen [26| and Poyatos, 
Cirac, and Zoller )• Although process tomography 
can be done using only product-state inputs and one- 
qubit measurements, it requires an enormous number of 
runs of the experiment since the output states resulting 
from 16 different input density matrices must be deter- 
mined via quantum state tomography. 

A less stringent, but much more easily calculated, cri- 
terion is that the probability distributions for each oracle 
should be close to the ideal distributions. Again, it is de- 
sirable to have the fidelity of the actual distribution to 
the ideal distribution for each oracle as close to 1 as pos- 
sible. 8 



VII. A HIERARCHY OF EXPERIMENTS 

This collection of different implementations of the 
same algorithm could be used as the basis for a series of 
experiments, each building on the last, each more compli- 
cated than the last, each demonstrating improved quan- 
tum control. For example, once a basic coincidence-basis 
CNOT is working, it would be relatively simple to add 
a small number of waveplates and a source of entangled 
photons to do the circuit in Fig-JSJ Once a scalable CNOT 
is achieved, these two different CNOT circuits could be 
combined to do the more complicated implementation 
of Grover's algorithm in Fig.0 demonstrating the abil- 



Knill et al. 1281 have a useful discussion of these issues where 
they advocate the entanglement fidlelity to measure the quality 
of an experimental implementation of the five-qubit code. They 
describe a simple way of measuring the entanglement fidelity that 
could be easily generalized to the setting of Grover's algorithm. 



9 



ity to combine a scalable CNOT with further non-trivial 
quantum computations. Finally, in the more distant fu- 
ture, the implementation using two scalable CNOTs would 
make a good testing ground for techniques for combining 
LOQC components. 
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